Bayesian Mixed Membership Models for Soft Classification

نویسندگان

  • Elena A. Erosheva
  • Stephen E. Fienberg
چکیده

The paper describes and applies a fully Bayesian approach to soft classification using mixed membership models. Our model structure has assumptions on four levels: population, subject, latent variable, and sampling scheme. Population level assumptions describe the general structure of the population that is common to all subjects. Subject level assumptions specify the distribution of observable responses given individual membership scores. Membership scores are usually unknown and hence we can also view them as latent variables, treating them as either fixed or random in the model. Finally, the last level of assumptions specifies the number of distinct observed characteristics and the number of replications for each characteristic. We illustrate the flexibility and utility of the general model through two applications using data from: (i) the National Long Term Care Survey where we explore disability classifications; (ii) semantic decompositions of abstracts and bibliographies from articles published in The Proceedings of the National Academy of Sciences. In the first application we use a Monte Carlo Markov chain implementation for sampling from the posterior distribution. In the second application, because of the size and complexity of the data base, we use a variational approximation to the posterior. We also include a guide to other applications of mixed membership modeling.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Mixed Membership Models for Soft Clustering and Classification

The paper describes and applies a fully Bayesian approach to soft clustering and classification using mixed membership models. Our model structure has assumptions on four levels: population, subject, latent variable, and sampling scheme. Population level assumptions describe the general structure of the population that is common to all subjects. Subject level assumptions specify the distributio...

متن کامل

طبقه‌بندی زیرپیکسلی تصاویر ابرطیفی براساس تعمیم الگوریتم معاوضه پیکسلی و ارزیابی آن

The capability of the matter identification is developed considerably in hyperspectral images. The spectral reflectance of surfaces in these imaging systems in the visible and near infrared range of the electromagnetic spectrum is recorded in extremely narrow and continuous bands. But for some reasons, such as existence the mixed pixels and low spatial resolution of these images, is difficult t...

متن کامل

Mixed Membership Models for Time Series

20.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419 20.1.1 State-Space Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419 20.1.2 Latent Dirichlet Allocation . . . . . . . . . . . ...

متن کامل

Fuzzy ARTMAP Based Neurocomputational Spatial Uncertainty Measures

This paper proposes non-parametric measures for the fuzzy ARTMAP computational neural network to handle spatial uncertainty in remotely sensed imagery classification, i.e., ART Commitment (ART-C) and ART Typicality (ART-T), expressing in the first case the degree of commitment a classifier has for each class for a specific pixel, and in the second case, how typical that pixel’s reflectances are...

متن کامل

Exponential family mixed membership models for soft clustering of multivariate data

For several years, model-based clustering methods have successfully tackled many of the challenges presented by data-analysts. However, as the scope of data analysis has evolved, some problems may be beyond the standard mixture model framework.One suchproblem iswhenobservations in a dataset come fromoverlapping clusters, whereby different clusters will possess similar parameters for multiple va...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004